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ADDRESSABLE PROTEIN ARRAYS 

gacjcgroun d of the Invention 

The invention relates to fixed airays of nucleic acid-protein fusions 
and, in particular, RNA-protein fusions, on solid supports. 

Certain macromolecules, such as proteins, are known to interact 
specifically with other molecules based on the three-dimensional shapes and 
electronic distributions of those molecules. For example, proteins interact 
selectively with other proteins, with nucleic acids, and with small-molecules. 
Modern pharmaceutical research relies on the study of these interactions; the 
development of new drugs depends on the discovery of compounds that bind 
specifically to biologically important molecules. 

The discovery of a single drug candidate can require the screening of 
thousands of compounds. It is therefore important to be able to screen large 
numbers of compounds rapidly and efficiently. One method for screening a 
large number of 'compounds is to fix possiblabinding partners, such as proteins, 
to a solid support. 

It is difficult to prepare arrays of isolated proteins on solid supports, 
however, for a variety of reasons. First of all, proteins cannot always be easily 
attached to the planar surfaces traditionally used to make other fixed arrays, 
such as nucleic acid microchips. More importantly, because proteins can 
interact with the functional groups on the surfaces of these supports, the 
proximity of the protein to the surface can lead to disruption of the protein 
structure. 
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Summary of the Invention 
In general, the invention features a solid support including an array 
of immobilized capture probes; each of the capture probes includes a non- 
nucleosidic spacer group and an oligonucleotide sequence to which a nucleic 
acid-protein fusion is bound (for example, hybridized or covalently bound). In 
preferred embodiments^tKe nucleic acid-protein fusion is an RNA-protein 
fusion, and the protein component is encoded by the nucleic acid (for example, 
the RNA). The spacer group can include a polyalkylene oxide, for example, 
polyethylene; oxide. A preferred spacer group includes hexaethylene oxide. 
The capture probe may also include a photocleavable linker. 

The oligonucleotide sequence can include a modified base, such as 
5-propyne pyrimidine. It can also include an internucleotide analog (such as 3 f - 
phosphoramidate) or a carbohydrate modification (such as a 2-O-methyl 
group). The nucleic acid-protein fusion can include a hybridization tag 
sequence. The hybridization tag sequence can also include a modified base, an 
internucleotide analog, or a carbohydrate modification. 

In a preferred embodiment, the capture probe further includes a 
reactive moiety (for example, a nucleophilic group), such as a primary amino 
group. In another preferred embodiment, the nucleic acid-protein fusion is 
covalently linked to the capture probe (for example, by photo-crosslinking); in 
one preferred approach, *thfs is accomplished by including one or more psoralen 
moieties in the capture pirobe or in the capture probe- fusion hybridization 
reaction mixture. A preferred solid support is a glass or silica-based chip. 

In a related aspect, the invention features a solid support including an 
array of immobilized capture probes; each of the capture probes is attached to 
the surface of the solid support through a non-nucleosidic spacer group, and 
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each of the capture probes includes an oligonucleotide sequence to which a 
nucleic acid-protein fusion (for example, an RNA-protein fusion) is bound (for 
example, hybridized or covalently bound). 

In another related aspect, the invention features a solid support 
including an array of immobilized capture probes; each of the capture probes 
includes a non-nucleosidic spacer group and an oligonucleotide sequence to 
which a ribosome display particle is bound (for example, hybridized or 
covalently bound). 

In yet another related aspect, the invention features a method for 
preparing a solid support., The method includes the steps of: (a) preparing a 
capture probe by linking a spacer group to an oligonucleotide sequence; (b) 
attaching the capture probe to the solid support; and (c) binding (for example, 
hybridizing or covalently binding) a nucleic acid-protein fusion (for example, 
an RNA-protein fusion) to the capture probe. 

The invention also features a second general method for preparing a 
solid support. This method includes the steps of: (a) attaching a spacer group 
to a surface, of the solid support; (b) attaching a Afunctional linker to the spacer 
group; (c) attaching a capture probe to the bifimctional linker; and (d) binding 
(for example, hybridizing or covalently binding)„a nucleic acid-protein fusion 
(for example, an RNA-protein fusion) to the capture probe. 

In a second aspect, the invention features a method for detecting an 
interaction between ^_J^o^^j^d^ajm^qtm± The method includes the steps 
of: (a) providing a solid support including an array of immobilized capture 
probes, where each of the capture probes includes a non-nucleosidic spacer 
group and an oligonucleotide sequence to which a nucleic acid-protein fusion is 
bound (for example, hybridized or covalently bound); (b) contacting the solid 
support with a candidate compound under conditions which allow an 
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interaction between the protein portion of the nucleic acid-protein fusion and 
the compound; and (c) analyzing the solid support for the presence of the 
compound as an indication of an interaction between the protein and the 
compound. j 
5 Alternatively, the invention features another method for detecting an 

interaction between a protein and a compound; this method involves the steps 
of: (a) providing a population of nucleic acid-prdtein fusions; (b) contacting the 
population of nucleic acid-protein fusions with a candidate compound under 
conditions which allow an interaction between the protein portion of the 

10 nucleic acid-protein fusion and the compound; (c) contacting the product of 
step (b) with a solid support that includes an array of immobilized capture 
probes, each of the capture probes including a non-nucleosidic spacer group 
and an oligonucleotide sequence to which a nucleic acid-protein fusion binds 
(for example, hybridize^ or covalently binds); and (d) analyzing the solid 

1 5 support for the presence of the compound as an indication of an interaction 
between the protein and the compound. 

In a preferred embodiment of each of the above methods, the nucleic 
acid-protein fusion is an RNA-protein fusion. In another preferred 
embodiment, the compound is labeled. Compounds that can be screened using 

20 these methods include, without limitation, proteins, drugs, therapeutics, 
enzymes, and nucleic acids. 

In a third aspect^ the invention features an array (for example, an ~ 
addressable array) of nucleic acid-protein fusions including at least 10 2 
different fusions/cm 2 . Preferably, the nucleic acid-protein fusions are RNA- 

25 protein fusions, and the array includes at least 10 4 different fusions/cm 2 . 

In a related aspect, the invention features a method for generating an 
addressable array of molecules. The method involves: (a) providing a solid 
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support on which an array of nucleic acid molecules is immobilized; (b) 
contacting the solid support with a population of addressable molecules; and (c) 
allowing the addressable molecules to orient themselves on the solid support by 
sequence-dependent recognition and binding of the immobilized nucleic acid 
molecules. 

In preferred embodiments of this method, the addressable array of 
molecules is an array of nucleic acid-protein fusions (for example, an array of 
RNA-protein fusions); the addressable molecules orient themselves on the solid 
support by base pairing (for example, hybridization) with the immobilized 
nucleic acid molecules; the solid support is a glass or silica-based chip; and the 
nucleic acid molecules immobilized on the solid support are capture probes, 
each including a non-nucleosidic spacer group and an oligonucleotide sequence 
to which the addressable molecule binds. 

As used herein, by an "array" is meant a fixed pattern of 
immobilized objects on a solid surface or membrane. Typically, the array is 
made up of nucleic acid-protein fusion molecules (for example, RNA-protein 
fusion molecules) bound to capture nucleic acid sequences which themselves 
are immobilized on the solid surface or membrane, pie array preferably 
includes at least 10 2 , more preferably at least 10 3 , and%iost preferably at least 
10 4 different fusions, and these fusions are preferably arrayed on a 125 x 80 
mm, and more preferably on a 10 x 10 mm, surface. By an "addressable array" 
is meant that the locations,, or addresses, on the solid support of the members of 
the array (for example, the nucleic acid-protein fusions) are known; the 
members of the array are referred to as "addressable molecules" and are 
utilized in methods for screening for subsequent molecular interactions (for 
example, for screening for interactions between the addressable nucleic acid- 
protein fusions and candidate therapeutics). 
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By "nucleic acid-protein fusion" is meant a nucleic acid covalently 
bound to a protein. By "nucleic acid" is meant any two or more covalently 
bonded nucleotides or nucleotide analogs or derivatives. As used herein, this 
term includes, without limitation, DNA, RNA, and PNA. By "protein" is 
meant any two or more amino acids, or amino acid analogs or derivatives, 
joined by peptide or peptoid bond(s), regardless of length or post-translational 
modification. As used herein, this term includes, without limitation, proteins, 
peptides, and polypeptides. 

By "hybridizatioii tag" is meant a non-coding oligonucleotide 
sequence that differs sufficiently in sequence from other nucleic acid sequences 
in a given population or reaction mixture that significant cross-hybridization 
does not occur. When multiple hybridization tags are utilized in a sirigle 
reaction mixture, these tags also preferably differ in sequence from one another 
such that each has a unique binding partner under the conditions employed. 

By a "population" is meant more than one molecule. 

By a "solid support" is meant any solid surface including, without 

. * 

limitation, any chip (for example, silica-based, glass, or gold chip), glass slide, 
membrane, bead, solid particle (for example, agarose, sepharose, or magnetic 
bead), column (or column material), test tube, or microtiter dish. 

Brief De scription of the Drawings 
Figure 1 is a drawingshowing the silylation of a glass surface* the 

derivatization of the resulting amino groups, and the attachment of a capture 

probe to the modified surface. 

Figure 2 is a drawing illustrating a capture probe containing a non- 

nucleosidic spacer group and a reactive moiety. 

Figure 3 is a schematic diagram of the layout of the FLAG and 
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HA1 1 fusion chip capture probes utilized in Figures 4 and 5. In this Figure, t7, 
tag, aul, au5, flag, hal, irs 5 and kt3 represent the capture probes CPt7 (positive 
control), CPtag (positive control), CPaul (negative control), CPau5 (negative 
control), CPflag, CPhal 1, CPirs (negative control), and CPkt3 (negative 
control), respectively. 

Figure 4 is a phosphorimage demonstrating hybridization of nucleic 
acid-protein fusions (FLAG and HA1 1) to capture probes immobilized on a 
chip. 

Figure 5 is a fluorimage demonstrating hybridization of nucleic acid- 
protein fusions (FLAG and HA1 1) to capture probes immobilized on a chip and 
subsequent recognition with anti-HAl 1 monoclonal antibodies. 

Figure 6 is a schematic diagram of the layout of the Myc fusion chip 
capture probes utilized in Figures 7 and 8. In this Figure, capture probes CP01, 
CP33, CP80, CP 125, CPmm, and CPns (described herein) were arranged on the 
chip as follows: CP01 at locations Al, Bl, CI, A4, B4, and C4; CP33 at 
locations Dl, El, Fl, D4, E4, and F4; CP80 at locations A2, B2, C2, A5, B5, 
and C5; CP125 at locations D2, E2, F2, D5, E5, and F5; CPmm at locations 
A3, B3, C3, A6, B6, and C6; and CPns at locations D3, E3, F3, D6, E6, and F6. 

Figure 7 is a phosphorimage demonstrating^hybridization of nucleic 
acid-protein fusions (Myc) to capture probes immobilized on a chip. 

Figure 8 is a fluorimage demonstrating hybridization of nucleic acid- 
protein fusions (Myc) to capture probes immobilized on a chip and subsequent 
recognition with anti-Myc monoclonal antibodies. 

Description of the Preferred Embodiments 
The invention features support-based, addressable arrays of proteins, 
and methods for preparing and using these arrays. The arrays are prepared by 
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fixing oligonucleotide sequences, the capture probes (or capture oligos), to a 
support in a defined array. The capture probes are then used to bind nucleic 
acid-protein fusions, such as RNA-protein fusions. Such binding may occur 
through base pairing (for example, through Watson-Crick base pairing, pseudo 
5 Watson-Crick base pairing involving modified bases, or Hoogsteen base 
pairing) between the nucleic acid component of the fusion and a 
complementary capture probe, or may occur through any other type of 
sequence-dependent recognition and binding of the capture probe (including, 
without limitation, poly ami de-mediated nucleic acid groove binding or specific 

1 0 binding by nucleic acid-binding proteins such as transcription factors). The 
result of the binding interactions between the fusions and the capture probes is 
a defined, addressable array of proteins attached to a solid support. 

A variety of materials can be used as the solid support. Examples of 
such materials include polymers (e.g., plastics), aminated surfaces, gold coated 

1 5 surfaces, nylon membranes, polyacrylamide pads deposited on solid surfaces, 
silicon, silicon-glass (e.g., microchips), silicon wafers, and glass (e.g., 
microscope slides). Microchips, and particularly glass microchips, represent a 
preferred solid support surface. 

If the surface is not already aminated, it can be modified to provide a 

20 layer of amino groups. For example, a glass microscope slide can be treated 
with a silylating agent such as trialkoxyaminosilane to provide a surface of 
primary amino groups thatexists as a monolayer or 3-8 molecular layers. This 
reaction is illustrated in Figure 1 . The silane-treated surface is then derivatized 
with a homobifunctional or heterobifunctional linker that permits the 

25 attachment of oligonucleotides at discrete positions. Phenylene 

1,4-diisofhiocyanate is a useful homobifunctional linker; amino-surfaces 
derivatized with this reagent have isothiocyanate functionalities that are 
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available to covalently react with the primary amino groups on the termini of 
oligonucleotides to form stable thiourea bonds, as shown in Figure 1 . 

The capture probes, i.e., the oligonucleotide sequences that are to be 
attached to the surface, are selected from the reverse-complements of the 
nucleic acid components of the nucleic acid-protein fusions (the targets). 
Capture probes preferably have between 5 and 30 nucleotide units, and more 
preferably have about 20 nucleotide units. Considerations for the selection of 
the exact sequence for a particular capture probe include melting temperature 
(Tm), interference from competing target sequences, and potential secondary, 
structure in the target sequence. Ideally, each unique capture probe has the 
same Tm, i.e., they are isoenergetic, so a single hybridization and washing 
temperature can be used successfully for all capture-target pairs. Commercially 
available computer programs (e.g., Oligo 4.0) can be used to help identify sets 
of capture probes with similar thermodynamic properties based on nearest 
neighbor treatments. 

The capture probes are modified before they are attached to the 
surface. One or more non-nucleosidic spacers, such as polyethylene oxide, are 
added to the terminus of the oligo. Preferably, 1-20 spacers and, most 
preferably, 4 spacers are utilized. These spacers may'be added to either the 5 1 
or preferably the 3' end of the oligonucleotide. A nucleophilic moiety is then 
attached to the spacer group. The result is a derivatized capture probe, as 
shown in Figure 2. A preferred spacer monomer includes hexaethylene oxide. 

Non-nucleosidic spacers are preferred over nucleosidic spacers, such 
as poly-T, because non-nucleosidic spacers have greater flexibility. In 
addition, their physical properties can be tailored relatively easily, and it is 
possible to minimize specific and non-specific nucleic acid interactions. 

The spacers provide physical separation between the oligonucleotide 
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and the solid surface and prevent interaction of the proteins with the support 
surface. This separation is important to ensure effective hybridization between 
the support-bound capture probe and the nucleic acid-protein fusion. In 
addition, the separation helps to minimize denaturation of the protein; the 
5 proteins are therefore able to adopt their native folded structures and remain 
functional. 

Alternatively ,dhe spacer groups can be attached directly to the solid 
support surface, instead of to the capture probes. For example, the spacer 
group can be attached to the amino groups on the surface. The Afunctional 

1 0 linker can then be attached to the other end of the spacer group. 

In addition to spacer groups, the capture probes may contain 
modifications that improve their hybridization properties and mismatch 
discrimination. For example, they may contain base analogs, such as 
5-propyrie pyrimidines, internucleotide analogues such as peptide nucleic acids 

15 (PNA), in which the bases"are connected by peptide-like linkages, or 
carbohydrate modifications. 

The capture probes are suspended in an aqueous alkaline solution, 
then applied to defined positions of the support surface; the nucleophilic 
moieties at the termini of the capture probes react with the active sites of the 

20 Afunctional linkers to form covalent bonds. The density of the capture probes 
can be controlled by adjusting reaction time and oligo concentration. 

Alternatively, the density can be controlled by doping the solution with capture _ 
oligos that lack nucleophilic moieties or doping with simple organic 
compounds that possess amine functional groups. 
25 The capture probes can be applied using liquid deposition 

techniques, such as inkjet delivery, robotic quill delivery or spotting, and other 
similar deposition methods-. They can also be applied using manual methods, 
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such as pipetting. The feature sizes of the capture probes can range from one 
square micron (e.g., when robotic techniques are used ) to one square 
millimeter (e.g., when a 0.2 microliter pipette is used). The result of the 
application of the capture probes is a defined, regular array of nucleic acid 
5 sequences. 

After a sufficient reaction time, the excess capture probe is washed 
away, and the remaining unreacted isothiocyanate. groups are blocked off. 
Dilute ammonia can be used as the blocking agent, resulting in a surface of 
phenyl thiourea groups. Blocking agents can also be selected to modify the 

1 0 surface energy, i.e., the hydrophobicity of the solid support surface. The 

hydrophobicity of the solid support surface is important because it affects the - 
background signal level and the extent of unwanted interaction of the protein 
portion of a nucleic acid-protein fusion with the surface. Examples of blocking 
agents that modify hydrophobicity are methylamine, amino alcohols, and 

1 5 suitable amino-containing polyethylene oxide moieties. 

Non-covalent blocking agents can also be used to further minimize 
non-specific interactions between the fusion and the solid support (e.g., glass) 
surface. Examples of such blocking agents include' non-specific proteins such 
as BSA or casein, or similar commercially available locking reagent 

20 formulations marketed for use with membranes. 

The capture probes arrayed on the surface of the solid support are 
then bound (for example^ by hybridization) to nucleic acid-protein fusions, such 
as RNA-proteiri fusions. A solution containing the mixture of fusions is 
adjusted to an appropriate salt concentration, applied to the surface, and 

25 incubated at a suitable temperature to allow for efficient binding (for example, 
hybridization) between the capture probe and the target sequence. The solution 
may also contain surfactants such as TWEEN-20, TRITON X-100, or SDS 
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(Sigma Chemical Co.) at concentrations of about 0.02% to about 1.0%; it may 
also include non-specific proteins, such as BSA. 

The experimental variables of salt concentration, temperature, and 
hybridization time are a function of the capture oligo design. A preferred range 
5 for the salt concentration is 25 mM to 2 M, with a concentration of about 750 
mM being especially preferred. A preferred temperature range is from 5 °C to 
70 °C, with 30 °C being-especially preferred. Preferred reaction times can be 
from 1 to 24 hours, with 3 hours being especially preferred. The variables for 
each experiment are determined empirically by standard methods. The 

10 hybridization step can be performed in a simple chamber device that constrains 
the liquid sample and prevents evaporation. 

When RNA-protein fusions are utilized as addressable arrays, the 
solution may also contain one or more components to suppress nuclease 
degradation of the RNA moiety. Preferred additions include (a) metal chelators 

1 5 (e.g., EDTA or EGTA) U at concentrations of between 1-10 mM, (b) placental 
RNase inhibitor protein (Promega) at concentrations of between 0.1-1 Unit/jxl; 
and (c) Anti-RNase protein (Ambion) at concentrations of between 0.1-1 
Unit/|xl. A separate strategy to specifically suppress 5-exonuclease 
degradation involves capping the 5'-terminus of the fusion RNA with a binding 

20 molecule. The capping strategy may be used in conjunction with one or more 
of the components listed above. In one particular capping approach, a native or 
analog (e.g., PNA) nucleic acid sequence conplementary to the 5'-terminus of 
the fusion RNA is added to generate a stable duplex at the 5'-end. The 
complementary sequence is preferably between 10-50 bases in length, and 

25 most preferably abount 20 bases in length. This added nucleic acid sequence 
may also contain pendant groove-binding, intercalating, or cross-linking 
moieties. Alternatively, native or analog nucleic acid sequences may be added 
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that form stable intermolecular hairpin, tetraloop, or pseudoknot secondary 
structures with the 5-terminus of the RNA. In the latter case, these nucleic 
acids are preferably about 20 - 100 bases in length, with about 35 bases being 
especially preferred. 

To the extent possible, the mixture of nucleic acid-protein fusions 
should be free of un-fused nucleic acids. Un-fused nucleic acids that are 
complementary to the capture probes will compete with the fusions for binding 
and will limit the amount of a given protein that can be displayed on the solid 
support. Preferably, at least 1% of the nucleic acid (for example, the RNA 
message) is fused to protein. 

Unique non-coding regions can be incorporated into the nucleic acid 
component of the fusion for the specific purpose of being "captured" by the 
capture probe; these non-coding regions are referred to as "hybridization tag 
sequences." The hybridization tag sequences may include the same analogue 
units as are described above for the capture probes. In some cases, both the 
capture probe and the tag sequences can be modified so they hybridize 
preferentially with each other, thereby minimizing interference from the coding 
fusion sequences. . 4 * 

Upon completion of the binding step, unbound nucleic acid-protein 
fusion is washed away with a buffer that has a higher stringency and a lower 
salt concentration than that used for the hybridization step. Again, the optimal 
buffer composition is determined empirically by standard methods. What 
remains upon completion of washing is an addressable array of proteins on the 
solid surface, attached via sequence-dependent recognition between the nucleic 
acid component of the fusion and the surface-bound capture oligo. The 
position of each protein is defined, because each fusion corresponds to the 
complementary capture probe. 
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In addition, if desired, the nucleic acid component of the fusion may 
be covalently linked to a part of the solid support, the linker, or the capture 
probe. Such covalently linked fusions provide particularly robust and versatile 
addressable arrays that may be used, for example, in the screening methods 
5 described herein. Covalently linked fusion arrays may be generated by any 
standard approach. According to one general technique, the fusions are 
addressed to specific locations on a solid surface via hybridization with 
corresponding capture probes, and a chemical cross-linking or attachment 
reaction is triggered to fix the location of the fusions on the solid support. One 

1 0 method to achieve such a covalent link involves functionalizing the DNA 
capture oligos during chemical synthesis with one or more pendant psoralen 
moieties, preferably positioned near adenosine bases. . After hybridizing the 
nucleic acid-protein fusion (for example, the RNA-protein fusion) to the 
support-bound capture oligos, the surface is exposed to long-wavelength UV 

1 5 light (for example, at 350 am). Light of this wavelength triggers a 

photoreaction between psoralen and an adjacent thymidine or uridine base in 
the duplex region, forming .a' cyclobutane linkage and permanently attaching 
the fusion to the solid support. Alternatively, psoralen itself (i.e., not linked to 
a capture probe) may be included in the hybridization solution or in a 

20 subsequent separate solution. The psoralen molecule intercalates between 

bases in double-stranded regions. Upon irradiation with long-wavelength UV 
light, the intercalated psoralen cross-links with thymidine or uridine bases 
(intrastrand and interstrand) in a Afunctional mode, forming covalent links 
between the capture probe and the nucleic acid component of the fusion. Other 

25 reactive, cross-linking reagents may also be used in place of psoralen in 
combination with triggering conditions appropriate for those reagents. 

Ordered, addressable arrays of peptide fragments can also be _ _ 
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prepared. To prepare these arrays, the fusion library is generated from short 
synthetic DNA sequences or fragments of cDNAs or genomic DNAs. In 
another variation, ribosome display particles, such as those described in Gold et 
aL, WO 93/03 1 72, can be hybridized to the solid support to generate the protein 
5 array. Again, these particles are immobilized on the solid support through a 
hybridization reaction between the capture oligo and the protein-coding RNA. 

Use 

The addressable protein arrays of the present invention have many 
uses. For example, a library of proteins can be displayed on a support, such as 

10 a microchip. This microchip can then be used to identify previously unknown 
protein-protein interactions. A probe protein can be detectably labeled, for 
example, with a radioisotope, chrqmophore, fluorophore, or chemiluminescent 
species, then incubated with the microchip. After the excess probe protein is 
washed away, the chip surface is analyzed for signal from the label. Detection 

15 of a signal indicates interaction of the labeled protein with one or more unique 
members of the protein library. The identity of proteins that are able to bind to 
the probe protein can then be determined from the location of the spots on the 
chip that become labeled due to binding of the probe. The same approach can 
also be used to screen protein libraries for protein-ligand interactions and 

20 protein-nucleic acid interactions. 

Other methods can be used to detect protein-protein, protein-ligand, 
or protein-nucleic acid interactions. For example, when the solid surface used 
to form the protein array is a gold layer, surface plasmon resonance (SPR) can 
be used to detect mass changes at the surface. When gold surfaces are 

25 employed, the reactive moiety on the oligonucleotide capture probe is a thiol 
group (rather than an amino group) and the gold surface need not be 
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functionalized to achieve capture probe attachment. Mass spectrometry 
(especially, Maldi-Tof) can also be used to analyze species bound to unique 
members of the protein library. 

Another application of protein arrays is the rapid determination of 
5 proteins that are chemically modified through the action of modifying enzymes 
such as protein kinases, acyl transferases, and methyl transferases. By 
incubating the protein array with the enzyme of interest and a radioactively 
labeled substrate, followed by washing and autoradiography, the location and 
hence the identity of those proteins that are substrates for the modifying 

1 0 enzyme may be readily determined. Further localization of the modification 
sites can be achieved using ordered displays of fragments of these proteins. 

The protein arrays can also be used to identify the unknown protein 
targets of therapeutically active compounds. For example, a therapeutic 
compound may be applied to a protein array derived from cellular RNA. 

1 5 Detection of the captured therapeutic compound, either through its bound label 
or directly (for example, by mass spectrometry or surface plasmon resonance) 
reveals the compound's binding partner or partners. In addition, arrays can also 
be used in the development of protein-based diagnostics. For example, a solid 
support containing a variety of proteins associated with various illnesses can be 

20 prepared. A single patient sample, which might contain one or more proteins 
whose interactions with the support-bound proteins would be indicative of 
certain illnesses, can then be contacted with the support. Thus, a single sample 
can be used to simultaneously detect the presence of several conditions, or to 
distinguish between conditions. Alternatively, addressable arrays may be used 

25 to quantify target molecules in a sample. In one particular example, 

addressable arrays of single chain antibodies or antibody mimics may be used 
for quantifying a target protein (or proteins) in a biological sample. The arrays 
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can also be used in the emerging fields of proteomics and functional genomics. 

The specific fusions that are identified as binding specifically to a 
probe molecule can be removed from the support surface. In one method, the 
fusion is released by disrupting hybridization with the capture probe. In one 
particular approach, the specified fusion is physically separated from the rest of 
the fusions, then treated with a denaturing agent, such as a chemical reagent or 
heat, to disrupt the base pairing with the capture oligo. The liberated fusion is 
then recovered from the solution. 

Alternatively, the entire capture probe can be detached. During solid 
support preparation, a light-sensitive linker can be used to attach the capture 
probe to the solid surface. Following identification of the active fusion, a laser - 
beam of the appropriate wavelength can be used to cleave the linker, thus 
releasing the desired fusion. Following release from the surface by any of the 
above methods, the fusion can be specifically recovered and manipulated, for 
example, using PCR, and further characterized. 

There now follow particular examples of the preparation of protein 

arrays according to the invention. These examples are provided for the puipose 

v 

of illustrating the invention, and should not be construed as limiting. 

Example 1: Silvlation of a Glass Surface 

Select grade, low-iron content, pre-cleaned 1 x 3 inch glass 
microscope slides (VWR Scientific) are prepared by heating with 1 M 
hydrochloric or nitric acid for 30 minutes at 70 °C. The slides are then 
subjected to three 5-minutes washes, using fresh distilled water for each wash. 
A 1% solution of aminopropyltrimethoxysilane (Gelest, Inc.) in 95% 
acetone/5% water is prepared and allowed to hydrolyzeibr at least five 
minutes. The glass slides are immersed in the hydrolyzed silane solution for 



10 
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2-20 minutes with gentle agitation. Excess silane is removed by subjecting the 
slides to ten 5 -minute washes, using fresh portions of 95% acetone/5% water 
for each wash, using gentle agitation. The slides are then cured by heating at 
1 1 0 °C for 20-45 minutes. 

Example 2j Derivatiz ^tinn with a Homobifunctional Linker 

Silane treated.slides from Example 1 are immersed in a freshly 
prepared 0.2% solution of phenylene 1 ,4-diisothiocyanate (Aldrich Chemical 
Co.) in 90% DMF/1 0% pyridine for two hours, with gentle agitation. The 
slides are washed sequentially with 90% DMF/1 0% pyridine, methanol, and 
acetone. After air drying, the functionalized slides are stored at 0°C in a 
vacuum desiccator over anhydrous calcium sulfate. 



F.xample 3: Synthesis rapture Probes 

Oligonucleotides are chemically synthesized in the 3'-*5' direction 
by coupling standard phosphoramidite monomers with an automated DNA 

15 synthesizer. Typically, 500 angstrom controlled-pore glass supports are used at 
the 0.2 micromole scale. After the desired probe sequence has been assembled 
(using A, G, C, and T monomers), hexaethylene oxide phosphoramidite 
monomer (Glen Research) is added to the 5' terminus, the coupling wait time 
is extended to 15 minutes by modifying the synthesizer program. Additional 

20 hexaethylene oxide monomer units are added in the same way. C-6 Amino 

phosphoramidite (Glen Research) is then added to the 5 1 terminus; the coupling 
wait time is again extended to 15 minutes. The acetic anhydride capping step 
and the final acidic detritylation step are eliminated. Capture probe sequences 
are cleaved from the solid support and deprotected with ammonium hydroxide, 

25 concentrated to dryness, precipitated in ethanol, and purified by reverse-phase 
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HPLC using an acetonitrile gradient in triethylammonium acetate buffer. 

Example 4: Attachment of Capture Probes 

The purified, amine-labeled capture probes from Example 3 are 
adjusted to a concentration of 500 micromplar in 100 mM sodium carbonate 
5 buffer (pH 9.0), and are applied to the deriyatized glass surfatce from Example 
2 at defined positions. For manual deposition, aliquots of 0.2 microliter each 
arc applied with a pipetman. ; The array is incubated at room temperature in a 
moisture-saturated environment for at least two hours. The attachment reaction 
is terminated by immersing the glass surface in an aqueous 1% ammonia 
1 0 solution for five minutes with gentle agitation. The glass surface is then 

subjected to three 5-minute washes, using fresh portions of distilled water for 
each wash. The array is. then soaked in 1 M phosphate buffered saline (PBS) 
solution for 2 hours at room temperature, then rinsed again for 5 minutes in 
distilled water. 

15 Example 5: Surface Modification 

v 

The ammonia solution from Example 4 is replaced with a 1-5% 
aqueous solution of a different primary amine-containing molecule. A small 
amount (10%) of methanol or acetonitrile cosolvent is added, if necessary. 

The glass surface is then subjected to three 5-minute washes, using 

20 fresh portions of distilled water for each wash. The surface is soaked in 1 M 
phosphate buffered saline (PBS) solution for 2 hours, then washed again for 5 
minutes with distilled water. The glass surface is immersed in a dilute, aqueous 
solution of a protein-containing blocking solution for several minutes, then 
subjected to three 5-minute washes, using fresh portions of distilled water for 

25 each wash. Finally, the surface is air dried. 
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Example 6: Fusion Hybridization 

50 microliters of a solution containing the RNA-protein fusions and 
consisting of 25 mM Tris-HCl (pH 8.0) and 100 mM potassium chloride are 
applied to the glass microchip surface in a chamber that can contain and seal 
5 the liquid. The solution is maintained at a specific temperature (determined by 
the capture oligo design) for at least three hours. Excess, non-hybridized 
RNA-protein fusions are removed by washing with 25 mM Tris-HCl (pH 8.0) 
and 50 mM potassium chloride for several minutes at the incubation 
temperature. The protein chip is subjected to two 1 5-minute washes, using a 
1 0 buffer that is more stringent and contains a lower salt concentration than the 
buffer used for the hybridization reaction. 

Example 7: Generation of an Exemplary FL AG and HA1 1 Fusion Chip 

Using the techniques essentially as described above, exemplary 
FLAG and HA1 1 fusion chips were generated as follows. 

15 For silylation of the glass microchip surface, pre-cleaned 1x3 inch 

glass microscope slides (Goldseal, #3010) were treated with Nanostrip 
(Cyantek) for 15 minutes, 10% aqueous NaOH at 70°C for 3 minutes, and 1% 
aqueous HC1 for 1 minute, thoroughly rinsing with deionized water after each 
solution. The slides were then dried in a vacuum desiccator over anhydrous 

20 calcium sulfate for several hours. A 1% solution of 

aminopropytrimethoxysilane (Gelest, Inc.) in 95% acetone / 5% water was 
prepared and allowed to hydrolyze for 20 minutes. The glass slides were 
immersed in the hydrolyzed silane solution for 5 minutes with gentle agitation. 
Excess silane was removed by subjecting the slides to ten 5-minute washes, 

25 using fresh portions of 95% acetone / 5% water for each wash, with gentle 
agitation. The slides were then cured by heating at 1 10°C for 20 minutes. 
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To derivatize with a homobifiinctional linker, the silane treated slides 
were immersed in a freshly prepared 0.2% solution o^phenylene 
1,4-diisothiocyanate (Aldrich Chemical Co.) in 90% DMF / 10% pyridine for 
two hours, with gentie agitation. The slides were washed sequentially with 
5 90% DMF / 1 0% pyridine, methanol, and acetone. After air drying, the 

functionalized slides were stored at 0°C in a vacuum desiccator over anhydrous 
calcium sulfate. 

Capture oligos were then designed and synthesized by standard 
techniques. In particular, the RNA employed to make the FLAG epitope fusion 

10 (17 amino acids total) consisted of 5'-r(UAA UAC GAC UCA CUA UAG 

GGA CAA UUA CUA UUU ACA AUU ACA AUG GAC UAC AAG GAC - 
GAU GAC GAU AAG GGC GGC UGG UCC CAC CCC CAG UUC GAG 
AAG) (SEQ ID NO: 1). The RNA employed to make the HA1 1 epitope fusion 
(20 amino acids total) consisted of 5'-r(UAA UAC GAC UCA CUA UAG 

1 5 GGA CAA UUA CUA UUU ACA AUU ACA AUG UAC CCC UAC GAC 
GUG CCC GAC UAC GCC GGC GGC UGG UCC CAC CCC CAG UUC 
GAG AAG) (SEQ ID NO: 2). In addition, in each case, the following DNA 
linker, which also contained the essential puromycin moiety at its 3-end, was 
ligated to the 3 -terminus of the RNA message: 

20 5 f -d(AAAAAAAAAAAAAAAAAAAAAAAAA^ (SEQ ID NO: 3). 

Specific, non-interacting, and thermodynamically isoenergetic 
sequences along the target RNAs were identified to serve as capture points. 
The software program HybSimulator v2.2 (Advanced Gene Computing 
Technology, Inc.) facilitated the identifcation and analysis of potential capture 

25 probes. A single specific capture probe for each RNA was ultimately identified 
(CPflag and CPhal 1). In addition, two sequences common to each RNA 
(CPt7, CPtag) were also identified to serve as positive controls. Four non-sense 
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sequences (CPaul, CPau5, CPirs, CPkt3) were generated as well to serve as 
negative controls. In total, eight unique sequences were selected. These 
oligonucleotides were prepared so that they could be attached to the chip 
surface at either the 3'- or 5'- terminus. Therefore, 16 capture probes were 
5 prepared comprising eight unique sequences. The following is a list of these 
capture probe sequences (5' to 3') (SEQ ID NOS: 4-1 1): 



CPt7: 


TGTAAATAGTAATTGTCCC 


CPtag: 


CTTCTCGAACTGGG 


CPaul: 


CCTGTAGGTGTCCAT 


CPau5: 


CAGGTAGAAGTCGGT 


CPflag: 


CATCGTCCTTGTAGTC 


CPhall: 


CGTCGTAGGGGTA 


CPirs: 


CCGCTCCTGATGTA 


CPkt3: 


TCGGGAGGCATTG. 



1 5 Oligonucleotide capture probes were chemically synthesized in the 3' 

to 5' direction by coupling standard phosphoramidite monomers using an 
automated DNA synthesizer (PE BioSystems Expedite 8909). Typically, 500 
angstrom controlled-pore glass supports were used at the 0.2 micromole scale. 
In the case of 5-attachment, after the desired probe sequence had been 

20 assembled (using A, G, C, and T monomers), four hexaethylene oxide 

phosphoramidite monomers (Glen Research) were added to the 5'-terminus. 
The coupling wait time was extended to 15 minutes by modifying the 
synthesizer program. Additional hexaethylene oxide monomer units were 
added in the same way. C-6 Amino phosphoramidite (Glen Research) was then 

25 added to the 5' terminus; the coupling wait time was again extended to 1 5 
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minutes. The acetic anhydride capping step and the final acidic detritylation 
were eliminated. In the case of 3 -attachment, oligonucleotide synthesis began 
with a control led-pore glass support bearing orthoganally protected primary 
hydroxyl and amino functionalities (Glen Research). Chain elongation began 
5 on the hydroxyl group, and the amino group remained protected during 

oligomer assembly, only being unveiled during the final deprotection. The first 
four monomers to be added were hexaethylene oxide units, followed by the 
standard A, G, C, and T monomers. All capture oligo sequences were cleaved 
from the solid support and deprotected with ammonium hydroxide, 
1 0 concentrated to dryness, precipitated in ethanol, and purified by reverse-phase 
HPLC using an acetonitrile gradient in triethylammonium acetate buffer. 
Apppropriate fractions from the HPLC were collected, evaporated to dryness in 
a vacuum centrifuge, and then coevaporated with a portion of water. 

To attach the purified, amine-labeled capture oligos, the oligos were 
15 adjusted to a concentration of 250 micromolar in 50 mM sodium carbonate 
buffer (pH 9.0) containing 10% glycerol. The oligos were then robotically 
applied (MicroGrid, BioRobotics) to the derivatized glass surface described 
above at defined positions ina5x5xl6 array pattern (384 spots) within a 20 x 
20mm area. The layout of these capture probes is shown schematically in 
20 Figure 3. A 16-pin tool was used to transfer the liquid, producing 200 micron 
features with a pitch of 600 microns. Each sub-grid of 24 spots represented a 
single capture probe (i.e., 24 duplicate spots). The array was incubated at room 
temperature in a moisture-saturated environment for 12-18 hours. The 
attachment reaction was terminated by immersing the glass surface in an 
25 aqueous 1% ammonia solution for five minutes with gentle agitation. The glass 
surface was then subjected to three 5-minute washes, using fresh portions of 
distilled water for each wash. The array was then soaked in a 10X PBS 
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(phosphate buffered saline) solution for 2 hours at room temperature, and then 
rinsed again for 5 minutes in distilled water. 

RNA-protein fusions between the peptides containing the FLAG and 
HA1 1 epitopes and their corresponding mRNAs were produced as generally 
5 described by Szostak et al., WO 98/3 1 700; and Roberts and Szostak, Proc. 
Natl. Acad. Sci. USA 94:12297-12302, 1997. The polymerase chain reaction 
using Taq polymerase (Promega) was used to amplify the sequences 5'-TAA 
TAC GAC TCA CTA TAG GGA CAA TTA CTA TTT ACA ATT ACA ATG 
GAC TAC AAG GAC GAT GAC GAT AAG GGC GGC TGG TCC CAC 

1 0 CCC CAG TTC GAG AAG (SEQ ID NO: 12) and 5'-TAA TAC GAC TCA 
CTA TAG GGA CAA TTA CTA TTT ACA ATT ACA ATG TAC CCC TAC 
GAC GTG CCC GAC TAC GCC GGC GGC TGG TCC CAC CCC CAG TTC 
GAG AAG (SEQ ID NO: 13) for FLAG and HA1 1 , respectively, using the 
oligonucleotide primers 5'-TAA TAC GAC TCA CTA TAG GGA CAA TTA 

1 5 CTA TTT ACA ATT (SEQ ID NO: 1 4) and 

5'-AGCGGATGCCTTCTCGAACTGGGGGTGGGA (SEQ ID NO: 15). The 
resulting PCR products were transcribed in vitro using T7 RNA polymerase 
(Ambion) to produce an mRNA containing the coding region for the FLAG and 
HA1 1 epitopes and the TMV untranslated region. This RNA was ligated to a 

20 DNA linker 5*- AAA AAA AAA AAA AAA AAA AAA AAA AAA CC (SEQ 
ID NO: 3) containing a 5' phosphate and a 3' puromycin by T4 DNA ligase 
(Promega) in the presence of an 80:20 mixture of the following two DNA 
splints: 5 '-TGCAACG ACC AACTTTTTTTTTTAGCGC ATGC (SEQ ID NO: 

25 17), each containing two biotin moieties at the 5' terminus. The resulting 
RNA-DNA chimera was purified by binding to Immobilized NeutiAvidin 
(Pierce), washing to remove unligated material, and eluting by displacement 
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using the sequence 5-GCATCCGCTAAAAAAAAAAGTTGGTCGTTGC 
(SEQ ID NO: 1 8). Subsequent translations were performed in rabbit 
reticulocyte lysate (Ambion) according to the manufacturer's instructions 
except that MgCl 2 (150 mM) and KC1 (425 mM) wereadded after 30 minutes 
to promote the formation of the puromycin-peptide bond. The RNA-peptide 
fusions were then purified by oligo dT affinity chromatography (Pharmacia), 
quantitated by scintillation counting of the incorporated vs. added 35 S 
methionine (Amersham), and concentrated to a low volume via membrane 
filtration (MicroCon). 

For hybridization of the fusions to the immobilized capture probes, 
aliquots of each of the FLAG and HA1 1 fusions, corresponding to 1 .0 picomole 
each, were combined and adjusted to 5X SSC (saline sodium citrate) + 0.02% 
Tween-20 in a volume of 20 microliters. The solution was applied to the glass 
chips described above, coverslips were placed on top, and the slides were 
placed in a moisture-saturated chamber at room temperature. After 18 hours 
the coverslips were removed, and the slides were washed sequentially with 
stirred 500 mL portions of IX SSC + 0.02% Tween-20, IX SSC + 0.02% 
Tween-20, and IX SSC for 5 minutes each, followed by a brief rinse with 0.2X 
SSC. After removal of liquid the slides were allowed to briefly air-dry. 

To detect hybridization, the FLAG and HA 1 1 -fusion chip was 
exposed to a phosphorimage screen (Molecular Dynamics) for 60 hours by 
direct contact between the screen and the chip. This allowed identification of 
the areas that contained hybridized fusions, since the peptides contained a 35 S 
methionine radiolabel which was detectable by the phosphor storage screen. 
As shown in Figure 4, analysis of the phosphorimage revealed that the fusions 
had successfully hybridized to their respective capture probes targeting specific 
areas of the RNA message (i.e., CPflag and CPhal 1). In addition, the four 
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non-sense capture probes, which were not compel ementary to any region of the 
FLAG or HA1 1 RNA, did not give any appreciable signal (i.e., CPaul, CPau5, 
CPirs, CPkt3). The positive control capture probe CPtag produced the 
expected signal, but the corresponding positive control capture probe CPt7 did 
5 not, likely due to degradation (e.g., exonuclease contamination) of the 5'-region 
of the targeted RNA. These results demonstrated the feasibility of addressing a 
mixture of peptides (as fusions) to specific locations on the surface of a chip. 
Both the 3'-attached capture probes and the 5'-attached capture probes were 
effective. 

10 A duplicate chip was probed with a monoclonal antibody that 

recognized the HA1 1 epitope. All of the following steps were performed at 
4°C. Nonspecific sites were first blocked with a solution containing IX PBS 
(phosphate buffered saline) + 1 % BSA (bovine serum albumin, RNAse free 
grade, Ambion) + 0.02% Tween-20 for 1 hour under a coverslip. The blocking 

15 solution was removed and 50 microliters of HA. 1 1 monoclonal antibody (100:1 
dilution, Berkeley Antibody Co.) in IX PBS + 0.02% Tween-20 was applied to 
the chip under a coverslip. After 2 hours the coverslip was removed, and the 
chip was washed with three 50mL portions of IX PBS + 0.02% Tween-20 for 5 
minutes each, with gentle agitation. Excess liquid was removed and then 50 

20 microliters of Cy3-labeled goat anti-mouse IgG (400:1 dilution, Amersham 
Pharmacia Biotech) in IX PBS + 0.02% Tween-20 was added under a 

coverslip. After 1 hour the coverslip was removed, and the chip was washed in 

i ^ - -.- 

three 50mL portions of IX PBS + 0.02% Tween-20 for 5 minutes each, with 
gentle agitation. Excess liquid was removed, and the chip was allowed to 
25 air-dry at room temperature. The chip was subsequently analyzed at 10 micron 
pixel resolution with a confocal laser scanner (ScanArray 3000, General 
Scanning) using preset excitation and emission wavelengths tuned to the Cy3 



WO 99/51773 



PCT/US99/07203 



-27- 

fluorophore. As shown in Figure 5, the resulting fluorimage was in accord with 
the phosphorimage and demonstrated that the HA1 1 peptide, which was 
covalently linked to its RNA message and fixed to the chip surface, was 
functional and was available to interact with its binding partner (the HA1 1 
monoclonal antibody). Moreover, although both the FLAG-fusion and the 
HA1 1 -fusion were presented on the chip surface, the HA1 1 monoclonal 
antibody was specific for its own epitope. In addition, the 3-attachment 
capture probes generally provided a better signal than the 5 ! -attachment capture 
probes. Without being bound to a particular theory, this may reflect the greater 
accessibility of the epitope when it is oriented away from the chip surface. 

Example 8: Generation of an Exemplary Mvc Fusion Chip 

Using the techniques essentially as described above, an exemplary 

Myc fusion chip was also generated as follows. 

For silylation of the glass surface, select grade, low-iron content, 

pre-cleaned 25 x 75mm glass microscope slides (VWR Scientific, #4831 1-950) 

were used as supplied. A 1 % solution of aminopropytrimethoxysilane (Gelest, 

"C 

Inc.) in 95% acetone / 5% water was prepared and allowed to hydrolyze for 20 
minutes. The glass slides were immersed in the hydrolyzed silane solution for 
5 minutes with gentle agitation. Excess silane was removed by subjecting the 
slides to ten 5-minute washes, using fresh portions of 95% acetone / 5% water 
for each wash, with gentle agitation. The slides were then cured by heating at 
110°Cfor 20 minutes. 

To derivatize with a homobifunctional linker, the silane treated slides 
were immersed in a freshly prepared 0.2% solution of phenylene 
1,4-diisothiocyanate (Aldrich Chemical Co.) in 90% DMF / 10% pyridine for 
two hours, with gentle agitation. The slides were washed sequentially with 
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90% DMF / 10% pyridine, methanol, and acetone. After air drying, the 
functionalized slides were stored at 0°C in a vacuum desiccator over anhydrous 
calcium sulfate. 

The capture oligos were synthesized based on the Myc sequence. In 
5 particular, the RNA employed to make the c-myc fusion (33 amino acids total) 
consisted of the following sequence: 

5'-r(UAAUACGACUC4gjAUAGGGACAAUUACUAUUUACAAUUACA 
AUGGGGACAAUUACUAUUUACAAUUACAAUGGCUGAAGAACAGA 
AACUGAUCUCUGAAGAAGACCUGCUGCGUAAACGUCGUGAACAGC 

1 0 UGAAACACAAACUGG AACAGCUGCGUAACUCUUGCGCU) (SEQ ID 
NO: 19). In addition, the following DNA linker, which also contains the 
essential puromycin moiety, was ligated to the 3 '-terminus of the RNA 
message: 5'-d(AAAAAAAAAAAAAAAAAAAAAAAAAAACC) (SEQ ID 
NO: 3). Three non-overlapping and thermodynamically isoenergetic 20-mer 

15 sequences along the RNA were identified to serve as capture points. In 

addition, dA25 (on the ligated DNA) was selected as a fourth target area. The 
targeted sequences began at nucleotide positions 1, 33, 80, and 125 (CP01, 
CP33, CP80 and CP125, respectively). A mismatch sequence, derived from 
target sequence 33 and containing four internal and adjacent nucleotide 

20 mismatches, was also designed (CPmm). A non-sense sequence, corresponding 
to the reverse-orientation of CP33, was also utilized as a negative control 
(CPns). The following is a list of the capture probe sequences that were 
employed (5' to 3') (SEQ ID NOS: 20-25): 



CP01 
25 CP33 
CP80 



TTGTAAATAGTAATTGTCCC 
AGAGATCAGTTTCTGTTCTT 
AGTTTGTGTTTCAGCTGTTC 
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CP125: TTTTTTTTTTTTTTTTTTTTTTTTT 
Cprnm: AGAGATCTCAATCTGTTCTT 
Cpns: TTCTTGTCTTTGACTAGAGA 



Oligonucleotide capture probes were chemically synthesized in the 3' 
to 5' direction by coupling standard phpsphoramidite monomers with an 
automated DNA synthesizer (PE BioSystems Expedite 8909). Typically, 500 
angstrom control led-pore glass supports were used at the 0.2 micromole scale. 
After the desired probe sequence had been assembled (using A, G, C, and T 
monomers), hexaethylene oxide phosphoramidite monomer (Glen Research) 
was added to the 5 -terminus. The coupling wait time was extended to 15 
minutes by modifying the synthesizer program. Additional hexaethylene oxide 
monomer units were added in the same way. C-6 Amino phosphoramidite 
(Glen Research) was then added to the 5* terminus; the coupling wait time was 
again extended to 15 minutes. The acetic anhydride capping step and the final 
acidic detritylation were eliminated. Capture oligo sequences were cleaved 
from the solid support and deprotected with ammonium hydroxide, 
concentrated to dryness, precipitated in ethanol, and purified by reverse-phase 
HPLC using an acetonitrile gradient in tri ethyl ammonium acetate buffer. 
Apppropriate fractions from the HPLC were collected, evaporated to dryness in 
a vacuum centrifuge, and then coevaporated with a portion of water. 

To attach these purified, amine-labeled capture oligos, the oligos 
were adjusted to a concentration of 500 micromolar in 100 mM sodium 
carbonate buffer (pH 9.0) and were applied to the derivatized glass surface at 
defined positions in a 6 x 6 array pattern (36 spots) within a 20 x 20mm area 
(as shown in Figure 6). CP01 was applied to locations Ai, Bl, CI and A4, B4, 
C4. CP33 was applied to locations Dl, El, Fl and D4, E4, F4. CP80 was 
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applied to locations A2, B2, C2 and A5, B5, C5. CP 125 was applied to 
locations D2, E2, F2 and D5, E5, F5. Cpmm was applied to locations A3, B3, 
C3 and A6, B6, C6. Cpns was applied to locations D3, E3, F3 and D6, E6, F6. 
For manual deposition, aliquots of 0.2 microliter each were applied with a 
5 pipetman. The array was incubated at room temperature in a 

moisture-saturated environment for 12-18 hours. The attachment reaction was 
terminated by immersing the glass surface in an aqueous 1% ammonia solution 
for five minutes with gentle agitation. The glass surface was then subjected to 
three 5-minute washes, using fresh portions of distilled water for each wash. 
1 0 The array was then soaked in a 1 OX PBS (phosphate buffered saline) solution 
for 2 hours at room temperature, and then rinsed again for 5 minutes in distilled 
water. 

RNA-protein fusions between a 33 amino acid peptide containing the 
c-myc epitope and its mRNA were produced as described by Szostak et al., 

1 5 WO 98/3 1 700; and Roberts-and Szostak, Proc. Natl. Acad. Sci. USA 94: 12297- 
12302, 1 997. The polymerase chain reaction using Taq polymerase (Promega) 
was used to amplify the sequence 5'-AGC GCA AGA GTT ACG CAG CTG 
TTC CAG TTT GTG TTT CAG CTG TTC ACG ACG TTT ACG CAG CAG 
GTC TTC TTC AGA GAT CAG TTT CTG TTC TTC AGC CAT (SEQ ID 

20 NO: 26) using oligonucleotide primers 5-AGC GCA AGA GTT ACG CAG 
CTG (SEQ ID NO: 27) and 5'-TAA TAC GAC TCA CTA TAG GGA CAA 
TTA CTA TTT ACA ATT ACA ATG GCT GAA GAA CAG AAA CT (SEQ 
ID NO: 28). The resulting PCR product was transcribed in vitro using T7 RNA 
polymerase (Ambion) to produce an mRNA containing the coding region for 

25 the c-myc epitope and the TMV untranslated region. This RNA was ligated to 
a DNA linker 5'- AAA AAA AAA AAA AAA AAA AAA AAA AAA CC 
(SEQ ID NO: 3) containing^ 5' phosphate and a 3' puromycin by T4 DNA 
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ligase (Promega) in the presence of a DNA splint with the sequence TTT TTT 
TTT TAG CGC AAG A (SEQ ID NO: 29). The resulting 154mer RNA-DNA 
chimera was purified by denaturing polyacrylamide gel electrophoresis (6% 
acrylamide). Translation was performed in rabbit reticulocyte lysate (Ambion) 
5 according to the manufacturer's instructions except that KC1 (500 mM) was 
added after 30 minutes to promote the formation of the puromycin-peptide 
bond. The RNA-peptide fusion was purified by oligo dT affinity 
chromatography (Pharmacia), quantitated by scintillation counting of the 
incorporated vs. added 35 S methionine (Amersham), and dried to a pellet. 2.5 

10 pmol of the c-myc fusion was produced. 

To hybridize to the capture probes, the dry myc- fusion pellet was 
taken up with 20 microliters of 5X SSC (saline sodium citrate) + 0.02% SDS, 
mixed, and then briefly centrifuged. The solution was applied to the slide 
described above, a coverslip was placed on top, and the slide was placed in a 

15 moisture-saturated chamber at room temperature. After 18 hours the coverslip 
was removed, and the slide was washed sequentially with stirred 500 mL 

portions of 5X SSC + 0.02% SDS, 2.5X SSC + 0.01% SSC, 2.5X SSC, and 

'v. 

1 .25X SSC for 5 minutes each. After removal of liquid the slide was allowed 
to briefly air-dry. 

20 To detect hybridization of the Myc fusions, the glass chip was 

exposed to a phosphorimage screen (Molecular Dynamics) for four hours by 
direct contact between the screen and the chip. This allowed identification of 
the areas that contained hybridized myc- fusion, since the myc peptide 
contained a 35 S methionine radiolabel which was detectable by the phosphor 

25 storage screen. As shown in Figure 7, analysis of the phosphorimage revealed 
that the myc-fusion had successfully hybridized to each of the four capture 
probes that targeted the myc RNA message and DNA linker sequence. In 
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addition, the non-sense capture probe, which was not complementary to any 
region of the myc RNA, did not give any appreciable signal. The capture probe 
sequence that contained several mismatches produced only a small amount of 
signal. These results demonstrated that it was possible to address a peptide (as 
5 a fusion) to a specific location on the surface of a chip. 

After phosphorimage analysis, the same chip was probed with a 
monoclonal antibody tha^recognized the c-myc epitope. All of the following 
steps were performed at 4°C. Nonspecific sites were first blocked with a 
solution containing IX PBS (phosphate buffered saline) + 1% BSA (bovine 

1 0 serum albumin, Sigma Chemical Co.) + 0. 1 unit per microliter RNAse inhibitor 
(Ambion) for 1 hour under a coverslip. The blocking solution was removed, 
and 50 microliters of 9E10 monoclonal antibody in IX PBS (400:1 dilution, 
Berkeley Antibody Co.) was applied to the chip under a coverslip. After 1 hour 
the coverslip was removed, and the chip was washed with three 50mL portions 

15 of IX PBS for 5 minutes each, with gentle agitation. Excess liquid was 

removed, and then 50 microliters of Cy3-labeled goat anti-mouse IgG in IX 
PBS (400:1 dilution, Amersham Pharmacia Biotech) was added under a 
coverslip. After 1 hour the coverslip was removed, and the chip was washed in 
three 50mL portions of IX PBS for 5 minutes each, with gentle agitation. 

20 Excess liquid was removed, and the chip was allowed to air-dry at room 

temperature. The chip was subsequently analyzed at 10 micron pixel resolution 
with a confocal laser scanner (ScanArray 3000, General Scanning) using preset 
excitation and emission wavelengths tuned to the Cy3 fluorophore. As shown 
in Figure 8, the resulting fluorimage was in accord with the phosphorimage and 

25 demonstrated that the myc peptide, which was covalently linked to its RNA 
message and fixed to the chip surface, was functional and was available to 
interact with its binding partner (the monoclonal antibody). 
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All publications and patents mentioned in this specification are 
herein incorporated by reference to the same extent as^f each individual 
publication or patent was specifically and individually indicated to be 
incorporated by reference. 

Other Embodiments 
From the foregoing description, it will be apparent that variations 
and modifications may be made to the invention described herein to adopt it to 
various usages and conditions. Such embodiments are also within the scope of 
the following claims. 

What is claimed is: 
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Claims 



1. A solid support comprising an array of immobilized capture 
probes, each of said capture probes comprising a non-nucleosidic spacer group 
and an oligonucleotide sequence to which a nucleic acid-protein fusion is 

5 bound. 

2. A solid support comprising an array of immobilized capture 
probes, wherein each of said capture probes is attached to the surface of said 
solid support through a non-nucleosidic spacer group, and wherein each of said 
capture probes comprises an oligonucleotide sequence to which a nucleic acid- 

10 protein fusion is bound. 

3. A solid support comprising an array of immobilized capture 
probes, each of said capture probes comprising a non-nucleosidic spacer group 
and an oligonucleotide sequence to which a ribosome display particle is bound. 

4. The solid support of claim 1, 2, or 3, wherein said nucleic acid- 
15 protein fusion is an RNA-protein fusion. 

5. The solid support of claim 1, 2, or 3, wherein said capture probe 
is bound to said nucleic acid-protein fusion by base pairing. 

6. The solid support of claim 1, 2, or 3, wherein said protein is 
encoded by said nucleic acid. 



20 



7. The solid support of claim 1, 2, or 3, wherein said spacer group 
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comprises a polyalkylene oxide, polyethylene oxide, or hexaethylene oxide. 

8. The solid support of claim 1, 2, or 3, wherein said capture probe 
comprises a photocleavable linker. 

9. The solid support of claim 1, 2, or 3, wherein said oligonucleotide 
5 sequence comprises a modified base, an internucleotide analog, or a 

carbohydrate modification. 

10. The solid support of claim 9, wherein said modified base is 5- 
propyne pyrimidine, said internucleotide analog is a 3-phosphoramidate 
linkage, or said carbohydrate modification is a 2-O-methyl group. 

10 11. The solid support of claim 1, 2, or 3, wherein said nucleic acid- 

protein fusion comprises a hybridization tag sequence. 

12. The solid support of claim 11, wherein said hybridization tag 
sequence comprises a modified base, an internucleotide analog, or a 
carbohydrate modification. 

15 13. The solid support of claim 1, 2, or 3, wherein said capture probe 

further comprises a reactive moiety. 

14. The solid support of claim 13, wherein said reactive moiety is a 
primary amino group. 

15. The solid support of claim 1, 2, or 3, wherein said solid support 
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is a glass or silica-based chip. 

16. The solid support of claim 1 , 2, or 3, wherein said nucleic acid- 
protein fusion is covalently linked to said capture probe. 

17. The solid support of claim 1 6, wherein said capture probe 
comprises one or more psoralen moieties. 

18. A method for preparing a solid support, said method comprising 

j 

the steps of: 

(a) preparing a capture probe by linking a spacer group to an 
oligonucleotide sequence; 

(b) attaching said capture probe to said solid support; and 

(c) binding a nucleic acid-protein fusion to said capture probe. 

t _ 

19. A method for preparing a solid support, said method comprising 
the steps of: 

(a) attaching a spacer group to a surface of said solid support; 

(b) attaching a bifunctional linker to said spacer group; 

(c) attaching a capture probe to said bifunctional linker; and 

(d) binding a nucleic acid-protein fusion to said capture probe. 

20. The methodof claim 1 8 or 19, wherein said nucleic acid-protein 
fusion is an RNA-protein fusion. 



21 . A method for detecting an interaction between a protein and a 
compound, said method comprising the steps of: 



WO 99/51773 PCT/US99/07203 

-37- 

(a) providing a solid support comprising an array of immobilized 
capture probes, each of said capture probes comprising^ non-nucleosidic 
spacer group and an oligonucleotide sequence to which a nucleic acid-protein 
fusion is bound; 

(b) contacting said solid support with a candidate compound under 
conditions which allow an interaction between the protein portion of said 
nucleic acid-protein fusion and said compound; and 

(c) analyzing said solid support for the presence of said compound as 
an indication of an interaction between said protein and said compound. 

22. A method for detecting an interaction between a protein and a 
compound, said method comprising the steps of: 

(a) providing a population of nucleic acid-protein fusions; 

(b) contacting said population of nucleic acid-protein fusions with a 
candidate compound under conditions which allow an interaction between the 
protein portion of said nucleic acid-protein fusion and said compound; 

(c) contacting the product of step (b) with a solid support comprising 
an array of immobilized capture probes, each of said capture probes comprising 
a non-nucleosidic spacer group and an oligonucleotide sequence to which a 
nucleic acid-protein fusion binds; and 

(d) analyzing said solid support for the presence of said compound as 
an indication of an interaction between said protein and said compound. 

23. The method of claim 21 or 22, wherein said nucleic acid-protein 
fusion is an RNA-protein fusion. 

24. The method of claim 21 or 22, wherein said compound is 



WO 99/51773 



PCT/US99/07203 



-38- 

labeled. 

25. The method of claim 21 or 22, wherein said compound is a 
protein, a therapeutic, an enzyme, or a nucleic acid. 

26. An array of nucleic acid-protein fusions, said array comprising 
5 at least 10 2 different fosions/cm 2 . 

27. The array of claim 26, wherein said array comprises at least 10 4 
different fusions/cm 2 . 

28. The array of claim 26, wherein said nucleic acid-protein fusions 
are RNA-protein fusions. 

10 29. A method 'for generating an addressable array of molecules, said 

method comprising: 

(a) providing a solid support on which an array of nucleic acid 

molecules is immobilized; 

(b) contacting said solid support with a population of addressable 

15 molecules; and 

(c) allowing said addressable molecules to orient themselves on said 
solid support by sequence-dependent recognition and binding of said 

immobilized nucleic acid molecules. 

i ... 

30. The method of claim 29, wherein said addressable array of 
20 molecules is an array of nucleic acid-protein fusions. 
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31. The method of claim 30, wherein said nucleic acid-protein 
fusions are RNA-protein fusions. m 

32. The method of claim 29, wherein said sequence-dependent 
recognition and binding comprises base pairing. 

33. The method of claim 29, wherein said solid support is a glass or 
silica-based chip. 

34. The method of claim 29, wherein said nucleic acid molecules 
immobilized on said solid support are capture probes, each comprising a non- 
nucleosidic spacer group and an oligonucleotide sequence to which said 
addressable molecule binds. 
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SEQUENCE LISTING 

< 11 0 > Phlyos , Inc . 

<12 0> ADDRESSABLE PROTEIN ARRAYS 



<130> 50036/009W02 

<150> 60/080,686 
<151> 1998-04-03 

<160> 2 9 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 99 

<212> RNA " 
<213> Artificial Sequence 



<220> 

<22 3> Oligonucleotide employed to construct FLAG epitope 
fusion 

<400> 1 

uaauacgacu cacuauaggg acaauuacua uuuacaauua caauggacua caaggacgau 60 
gacgauaagg gcggcugguc ccacccccag uucgagaag 99 

L .... ■'■ ,a 

<2io> 2 . ■ ' ' 

<211> 102 [ . <• f 

<212> RNA 

<213> Artificial Sequence 

<22 0> ;. : 
<223> Oligonucleotide employed to construct HA11 epitope 
fusion 

<400> 2 

uaauacgacu cacuauaggg acaauuacua uuuacaauua caauguaccc cuacgacgug 6 0 

cccgacuacg ccggcggcug gucccacccc caguucgaga ag 102 

<210> 3 

<211> 2 9 * ^) 

<212> DNA 

<213> Artificial Sequence. 
<220> 

<223> Oligonucleotide used for attaching puromycin 
<400> 3 

aaaaaaaaaa aaaaaaaaaa aaaaaaacc 29 



<210 1 
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2 

<211> 19 
<212> DNA 

<213> Artificial Sequence 

<220> *% 
<223> Oligonucleotide used for chip attachment 

<400> 4 
tgtaaatagt aattgtccc 

<210> 5 

<211> 14 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide used for chip attachment 

<400> 5 
cttctcgaac tggg 

<210> 6 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide used for chip attachment 

<400> 6 
cctgtaggtg tccat 

<210> 7 
<211> 15 

<212> DNA \. 
<213> Artificial Sequence 

<220> 

<223> Oligonucleotide used for chip attachment 

<400> 7 
caggtagaag tcggt 

<210> 8 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide used for chip attachment 

<400> 8 
catcgtcctt gtagtc 
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<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide used for chip attachment 
<400> 9 

cgtcgtaggg gta 13 

<210> 10 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide used for chip attachment 
<4 00> 10 

ccgctcctga tgta 14 

<210> 11 
<211> 13 
<212> DNA 

<213> Artificial Sequence 

<220> i 
<223> Oligonucleotide used for chip attachment 

<4 00> 11 l _ 

tcgggaggca ttg 13 

<210> 12 
<211> 99 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> FLAG amplification sequence 



<400> 12 

taatacgact cactataggg acaattacta tttacaatta caatggacta caaggacgat 60 
gacgataagg gcggctggtc ccacccccag ttcgagaag 99 

<210> 13 i ^ 

<211> 102 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HA11 amplification sequence 
<400> 13 

taatacgact cactataggg acaattacta tttacaatta caatgtaccc ctacgacgtg 60 
cccgactacg ccgg: ggctg gtcccacccc cagttcgaga ag 102 
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<210> 14 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide used for PCR 
<400> 14 

taatacgact cactataggg acaattacta tttacaatt 39 

<210> 15 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide used for PCR 

<400> 15 

agcggatgcc ttctcgaact gggggtggga 3 0 

<210> 16 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide used as a splint containing biotin 
moiety at 5' terminus 

<400> 16 

tgcaacgacc aacttttttt tttagcgcat gc 32 

<210> 17 : i; 

<211> 32 

<212> DNA - 

<213> Artificial Sequence 

<220> 

<223> Oligonucleotide used as a splint containing biotin 
moiety at 5 ' terminus 

<400> 17 

tgcaacgacc aacttttttt ttnagcgcat gc 32 

<210> 18 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oigonucleotide used for elution displacement 



<400> 18 
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gcatccgcta aaaaaaaaag ttggtcgttg c 

<210> 19 

<211> 169 

<212> RNA 

<213> Artificial Sequence 



5 

31 



<220> 

<223> Oligonucleotide used to make c-myc fusion 
<400> 19 

uaauacgacu cacuauaggg acaauuacua uuuacaauua caauggggac aauuacuauu 60 
uacaauuaca auggcugaag aacagaaacu gaucucugaa gaagaccugc ugcguaaacg 120 
ucgugaacag cugaaacaca aacuggaaca gcugcguaac ucuugcgcu 169 

L-.J .... 

<210> 20 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

A: 

<220> 

<22 3> Capture probe sequence 
<400> 20 

ttgtaaatag taattgtccc 20 

<210> 21 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Capture probe sequence 

■* 

<400> 21 

agagatcagt ttctgttctt 2 0 

<210> 22 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Capture probe sequence 
<400> 22 

agtttgtgtt tcagctgttc " " 2 0 

<210> 23 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Capture probe sequence 
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<400> 23 

tttttttttt tttttttttt ttttt 25 
<210> 24 

<211> 20 * 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Capture probe sequence 
<400> 24 

agagatctca atctgttctt 20 

<210> 25 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Capture probe sequence 

<400> 25 

ttcttgtctt tgactagaga 20 

<210> 26 
<211> 99 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> c-myc epitope amplification sequence 
<400> 26 

agcgcaagag ttacgcagct gttccagttt gtgtttcagc tgttoacgac gtttacgcag 60 
caggtcttct tcagagatca gtttctgttc ttcagccat ^ 99 

<210> 27 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide used for PCR 
<400> 27 

agcgcaagag ttacgcagct g 21 

<210> 28 
<211> 62 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Oligonucleotide used for PCR 
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<400> 28 

taatacgact cactataggg ' acaattacta tttacaatta caatggctga agaacagaaa 60 
ct 62 

<210> 29 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide used as a splint 



<400> 29 
tttttttttt agcgcaaga 
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